Dataset statistics
| Number of variables | 16 |
|---|---|
| Number of observations | 58386 |
| Missing cells | 10999 |
| Missing cells (%) | 1.2% |
| Duplicate rows | 25 |
| Duplicate rows (%) | < 0.1% |
| Total size in memory | 4.0 MiB |
| Average record size in memory | 72.0 B |
Variable types
| Numeric | 8 |
|---|---|
| Categorical | 8 |
| Dataset has 25 (< 0.1%) duplicate rows | Duplicates |
label_1 is highly overall correlated with pred_1 | High correlation |
label_5 is highly overall correlated with pred_5 | High correlation |
pred_0 is highly overall correlated with pred_1 and 3 other fields | High correlation |
pred_1 is highly overall correlated with label_1 and 4 other fields | High correlation |
pred_2 is highly overall correlated with pred_0 and 3 other fields | High correlation |
pred_4 is highly overall correlated with pred_0 and 3 other fields | High correlation |
pred_5 is highly overall correlated with label_5 and 4 other fields | High correlation |
race is highly imbalanced (52.5%) | Imbalance |
label_2 is highly imbalanced (59.8%) | Imbalance |
label_3 is highly imbalanced (85.7%) | Imbalance |
race has 8160 (14.0%) missing values | Missing |
gender has 2839 (4.9%) missing values | Missing |
Reproduction
| Analysis started | 2024-08-16 14:19:16.175507 |
|---|---|
| Analysis finished | 2024-08-16 14:19:27.227146 |
| Duration | 11.05 seconds |
| Software version | ydata-profiling vv4.9.0 |
| Download configuration | config.json |
subject_id
Real number (ℝ)
| Distinct | 12507 |
|---|---|
| Distinct (%) | 21.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15047124 |
| Minimum | 10001122 |
|---|---|
| Maximum | 19998444 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 456.3 KiB |
Quantile statistics
| Minimum | 10001122 |
|---|---|
| 5-th percentile | 10578325 |
| Q1 | 12601466 |
| median | 15108002 |
| Q3 | 17461126 |
| 95-th percentile | 19453522 |
| Maximum | 19998444 |
| Range | 9997322 |
| Interquartile range (IQR) | 4859660.2 |
Descriptive statistics
| Standard deviation | 2842240.5 |
|---|---|
| Coefficient of variation (CV) | 0.18888928 |
| Kurtosis | -1.1777726 |
| Mean | 15047124 |
| Median Absolute Deviation (MAD) | 2434888 |
| Skewness | -0.029466504 |
| Sum | 8.785414 × 1011 |
| Variance | 8.078331 × 1012 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 16662316 | 159 | 0.3% |
| 18001923 | 100 | 0.2% |
| 18902344 | 98 | 0.2% |
| 11021643 | 98 | 0.2% |
| 11648387 | 81 | 0.1% |
| 16924675 | 80 | 0.1% |
| 14508231 | 76 | 0.1% |
| 15656571 | 74 | 0.1% |
| 10578325 | 68 | 0.1% |
| 15131736 | 67 | 0.1% |
| Other values (12497) | 57485 |
| Value | Count | Frequency (%) |
| 10001122 | 5 | < 0.1% |
| 10001884 | 51 | |
| 10002013 | 14 | < 0.1% |
| 10002430 | 9 | < 0.1% |
| 10003255 | 2 | < 0.1% |
| 10004720 | 2 | < 0.1% |
| 10005001 | 2 | < 0.1% |
| 10006023 | 2 | < 0.1% |
| 10006501 | 2 | < 0.1% |
| 10008064 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 19998444 | 2 | |
| 19995320 | 3 | |
| 19995258 | 3 | |
| 19995179 | 1 | < 0.1% |
| 19994588 | 4 | |
| 19994233 | 3 | |
| 19991424 | 2 | |
| 19991085 | 1 | < 0.1% |
| 19990545 | 2 | |
| 19990078 | 4 |
age
Real number (ℝ)
| Distinct | 74 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 69.011184 |
| Minimum | 18 |
|---|---|
| Maximum | 255 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 57.1 KiB |
Quantile statistics
| Minimum | 18 |
|---|---|
| 5-th percentile | 26 |
| Q1 | 49 |
| median | 62 |
| Q3 | 75 |
| 95-th percentile | 91 |
| Maximum | 255 |
| Range | 237 |
| Interquartile range (IQR) | 26 |
Descriptive statistics
| Standard deviation | 45.523478 |
|---|---|
| Coefficient of variation (CV) | 0.65965363 |
| Kurtosis | 10.660975 |
| Mean | 69.011184 |
| Median Absolute Deviation (MAD) | 13 |
| Skewness | 3.1955872 |
| Sum | 4029287 |
| Variance | 2072.387 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 255 | 2839 | 4.9% |
| 91 | 1484 | 2.5% |
| 62 | 1356 | 2.3% |
| 63 | 1325 | 2.3% |
| 52 | 1296 | 2.2% |
| 72 | 1248 | 2.1% |
| 59 | 1236 | 2.1% |
| 64 | 1235 | 2.1% |
| 57 | 1229 | 2.1% |
| 54 | 1219 | 2.1% |
| Other values (64) | 43919 |
| Value | Count | Frequency (%) |
| 18 | 212 | |
| 19 | 311 | |
| 20 | 409 | |
| 21 | 291 | |
| 22 | 355 | |
| 23 | 421 | |
| 24 | 325 | |
| 25 | 359 | |
| 26 | 390 | |
| 27 | 383 |
| Value | Count | Frequency (%) |
| 255 | 2839 | |
| 91 | 1484 | |
| 89 | 265 | 0.5% |
| 88 | 538 | 0.9% |
| 87 | 512 | 0.9% |
| 86 | 540 | 0.9% |
| 85 | 685 | 1.2% |
| 84 | 695 | 1.2% |
| 83 | 707 | 1.2% |
| 82 | 724 | 1.2% |
race
Categorical
IMBALANCE  MISSING 
| Distinct | 33 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 8160 |
| Missing (%) | 14.0% |
| Memory size | 58.5 KiB |
| WHITE | |
|---|---|
| BLACK/AFRICAN AMERICAN | |
| OTHER | 1638 |
| HISPANIC/LATINO - PUERTO RICAN | 1164 |
| WHITE - OTHER EUROPEAN | 1156 |
| Other values (28) |
Length
| Max length | 41 |
|---|---|
| Median length | 5 |
| Mean length | 10.827619 |
| Min length | 5 |
Characters and Unicode
| Total characters | 543828 |
|---|---|
| Distinct characters | 27 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | WHITE |
|---|---|
| 2nd row | WHITE |
| 3rd row | WHITE |
| 4th row | WHITE |
| 5th row | BLACK/AFRICAN AMERICAN |
Common Values
| Value | Count | Frequency (%) |
| WHITE | 29729 | |
| BLACK/AFRICAN AMERICAN | 8413 | 14.4% |
| OTHER | 1638 | 2.8% |
| HISPANIC/LATINO - PUERTO RICAN | 1164 | 2.0% |
| WHITE - OTHER EUROPEAN | 1156 | 2.0% |
| WHITE - RUSSIAN | 923 | 1.6% |
| HISPANIC OR LATINO | 815 | 1.4% |
| UNKNOWN | 789 | 1.4% |
| ASIAN - CHINESE | 701 | 1.2% |
| BLACK/CAPE VERDEAN | 683 | 1.2% |
| Other values (23) | 4215 | 7.2% |
| (Missing) | 8160 | 14.0% |
Length
| Value | Count | Frequency (%) |
| white | 32120 | |
| black/african | 8757 | 11.3% |
| american | 8735 | 11.2% |
| 6012 | 7.7% | |
| other | 2834 | 3.6% |
| hispanic/latino | 2520 | 3.2% |
| asian | 2054 | 2.6% |
| european | 1326 | 1.7% |
| rican | 1164 | 1.5% |
| puerto | 1164 | 1.5% |
| Other values (38) | 10986 | 14.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| I | 68629 | |
| A | 66207 | |
| E | 53758 | |
| T | 41519 | 7.6% |
| H | 39438 | 7.3% |
| N | 38150 | 7.0% |
| C | 35052 | 6.4% |
| W | 33037 | 6.1% |
| R | 27940 | 5.1% |
| 27446 | 5.0% | |
| Other values (17) | 112652 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 543828 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| I | 68629 | |
| A | 66207 | |
| E | 53758 | |
| T | 41519 | 7.6% |
| H | 39438 | 7.3% |
| N | 38150 | 7.0% |
| C | 35052 | 6.4% |
| W | 33037 | 6.1% |
| R | 27940 | 5.1% |
| 27446 | 5.0% | |
| Other values (17) | 112652 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 543828 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| I | 68629 | |
| A | 66207 | |
| E | 53758 | |
| T | 41519 | 7.6% |
| H | 39438 | 7.3% |
| N | 38150 | 7.0% |
| C | 35052 | 6.4% |
| W | 33037 | 6.1% |
| R | 27940 | 5.1% |
| 27446 | 5.0% | |
| Other values (17) | 112652 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 543828 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| I | 68629 | |
| A | 66207 | |
| E | 53758 | |
| T | 41519 | 7.6% |
| H | 39438 | 7.3% |
| N | 38150 | 7.0% |
| C | 35052 | 6.4% |
| W | 33037 | 6.1% |
| R | 27940 | 5.1% |
| 27446 | 5.0% | |
| Other values (17) | 112652 |
gender
Categorical
MISSING 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2839 |
| Missing (%) | 4.9% |
| Memory size | 456.3 KiB |
| M | |
|---|---|
| F |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 55547 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | F |
|---|---|
| 2nd row | F |
| 3rd row | F |
| 4th row | F |
| 5th row | F |
Common Values
| Value | Count | Frequency (%) |
| M | 28144 | |
| F | 27403 | |
| (Missing) | 2839 | 4.9% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| m | 28144 | |
| f | 27403 |
Most occurring characters
| Value | Count | Frequency (%) |
| M | 28144 | |
| F | 27403 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 55547 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| M | 28144 | |
| F | 27403 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 55547 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| M | 28144 | |
| F | 27403 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 55547 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| M | 28144 | |
| F | 27403 |
pred_0
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 58297 |
|---|---|
| Distinct (%) | 99.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.22864699 |
| Minimum | 0.0007134835 |
|---|---|
| Maximum | 0.93613154 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 456.3 KiB |
Quantile statistics
| Minimum | 0.0007134835 |
|---|---|
| 5-th percentile | 0.019512462 |
| Q1 | 0.076932167 |
| median | 0.18095325 |
| Q3 | 0.34323808 |
| 95-th percentile | 0.59315564 |
| Maximum | 0.93613154 |
| Range | 0.93541806 |
| Interquartile range (IQR) | 0.26630592 |
Descriptive statistics
| Standard deviation | 0.18267495 |
|---|---|
| Coefficient of variation (CV) | 0.79893878 |
| Kurtosis | 0.10155325 |
| Mean | 0.22864699 |
| Median Absolute Deviation (MAD) | 0.1209184 |
| Skewness | 0.90499551 |
| Sum | 13349.783 |
| Variance | 0.033370136 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.11780363 | 2 | < 0.1% |
| 0.24564247 | 2 | < 0.1% |
| 0.23545875 | 2 | < 0.1% |
| 0.11404432 | 2 | < 0.1% |
| 0.08073465 | 2 | < 0.1% |
| 0.28054523 | 2 | < 0.1% |
| 0.19426298 | 2 | < 0.1% |
| 0.17781691 | 2 | < 0.1% |
| 0.2347063 | 2 | < 0.1% |
| 0.4517402 | 2 | < 0.1% |
| Other values (58287) | 58366 |
| Value | Count | Frequency (%) |
| 0.0007134835 | 1 | |
| 0.00071961153 | 1 | |
| 0.00081179786 | 1 | |
| 0.00089570356 | 1 | |
| 0.00090980896 | 1 | |
| 0.0009669827 | 1 | |
| 0.0009757418 | 1 | |
| 0.0009865833 | 1 | |
| 0.0011006723 | 1 | |
| 0.0012743742 | 1 |
| Value | Count | Frequency (%) |
| 0.93613154 | 1 | |
| 0.93382 | 1 | |
| 0.9289067 | 1 | |
| 0.9256189 | 1 | |
| 0.9232685 | 1 | |
| 0.92095166 | 1 | |
| 0.9200051 | 1 | |
| 0.9191345 | 1 | |
| 0.90677124 | 1 | |
| 0.9062864 | 1 |
label_0
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 456.3 KiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 58386 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 51003 | |
| 1 | 7383 | 12.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 51003 | |
| 1 | 7383 | 12.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 51003 | |
| 1 | 7383 | 12.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 58386 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 51003 | |
| 1 | 7383 | 12.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 58386 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 51003 | |
| 1 | 7383 | 12.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 58386 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 51003 | |
| 1 | 7383 | 12.6% |
pred_1
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 58285 |
|---|---|
| Distinct (%) | 99.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.14381424 |
| Minimum | 4.4961896 × 10-5 |
|---|---|
| Maximum | 0.99647397 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 456.3 KiB |
Quantile statistics
| Minimum | 4.4961896 × 10-5 |
|---|---|
| 5-th percentile | 0.0019736528 |
| Q1 | 0.011122281 |
| median | 0.044808384 |
| Q3 | 0.1856508 |
| 95-th percentile | 0.64826539 |
| Maximum | 0.99647397 |
| Range | 0.99642901 |
| Interquartile range (IQR) | 0.17452852 |
Descriptive statistics
| Standard deviation | 0.20828639 |
|---|---|
| Coefficient of variation (CV) | 1.4483016 |
| Kurtosis | 3.0725192 |
| Mean | 0.14381424 |
| Median Absolute Deviation (MAD) | 0.040527415 |
| Skewness | 1.926109 |
| Sum | 8396.7382 |
| Variance | 0.043383221 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.056017026 | 2 | < 0.1% |
| 0.06861152 | 2 | < 0.1% |
| 0.07065508 | 2 | < 0.1% |
| 0.061210033 | 2 | < 0.1% |
| 0.0425446 | 2 | < 0.1% |
| 0.46214718 | 2 | < 0.1% |
| 0.031990286 | 2 | < 0.1% |
| 0.019157823 | 2 | < 0.1% |
| 0.013577934 | 2 | < 0.1% |
| 0.31736004 | 2 | < 0.1% |
| Other values (58275) | 58366 |
| Value | Count | Frequency (%) |
| 4.4961896 × 10-5 | 1 | |
| 4.5468667 × 10-5 | 1 | |
| 5.226288 × 10-5 | 1 | |
| 5.640238 × 10-5 | 1 | |
| 7.616898 × 10-5 | 1 | |
| 7.8047895 × 10-5 | 1 | |
| 8.106019 × 10-5 | 1 | |
| 8.346491 × 10-5 | 1 | |
| 8.3913335 × 10-5 | 1 | |
| 8.4583764 × 10-5 | 1 |
| Value | Count | Frequency (%) |
| 0.99647397 | 1 | |
| 0.9950956 | 1 | |
| 0.9932274 | 1 | |
| 0.99279433 | 1 | |
| 0.99267393 | 1 | |
| 0.992002 | 1 | |
| 0.9901102 | 1 | |
| 0.9889313 | 1 | |
| 0.9884332 | 1 | |
| 0.9871696 | 1 |
label_1
Categorical
HIGH CORRELATION 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 456.3 KiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 58386 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 49578 | |
| 1 | 8808 | 15.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 49578 | |
| 1 | 8808 | 15.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 49578 | |
| 1 | 8808 | 15.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 58386 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 49578 | |
| 1 | 8808 | 15.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 58386 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 49578 | |
| 1 | 8808 | 15.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 58386 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 49578 | |
| 1 | 8808 | 15.1% |
pred_2
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 58258 |
|---|---|
| Distinct (%) | 99.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.02329472 |
| Minimum | 3.1557033 × 10-6 |
|---|---|
| Maximum | 0.74713737 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 456.3 KiB |
Quantile statistics
| Minimum | 3.1557033 × 10-6 |
|---|---|
| 5-th percentile | 0.00015165282 |
| Q1 | 0.001014794 |
| median | 0.0043468919 |
| Q3 | 0.019601202 |
| 95-th percentile | 0.11696669 |
| Maximum | 0.74713737 |
| Range | 0.74713421 |
| Interquartile range (IQR) | 0.018586408 |
Descriptive statistics
| Standard deviation | 0.051255015 |
|---|---|
| Coefficient of variation (CV) | 2.2002847 |
| Kurtosis | 27.029555 |
| Mean | 0.02329472 |
| Median Absolute Deviation (MAD) | 0.0039987507 |
| Skewness | 4.4771904 |
| Sum | 1360.0855 |
| Variance | 0.0026270766 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.011892402 | 2 | < 0.1% |
| 0.018520292 | 2 | < 0.1% |
| 0.01694852 | 2 | < 0.1% |
| 0.005958382 | 2 | < 0.1% |
| 0.0034950266 | 2 | < 0.1% |
| 0.0023882305 | 2 | < 0.1% |
| 0.00014194647 | 2 | < 0.1% |
| 0.011995183 | 2 | < 0.1% |
| 0.0008231159 | 2 | < 0.1% |
| 0.00318263 | 2 | < 0.1% |
| Other values (58248) | 58366 |
| Value | Count | Frequency (%) |
| 3.1557033 × 10-6 | 1 | |
| 3.6686322 × 10-6 | 1 | |
| 4.1523763 × 10-6 | 1 | |
| 4.6425284 × 10-6 | 1 | |
| 5.1734055 × 10-6 | 1 | |
| 5.183856 × 10-6 | 1 | |
| 5.324474 × 10-6 | 1 | |
| 5.3583226 × 10-6 | 1 | |
| 5.3898316 × 10-6 | 1 | |
| 5.4543116 × 10-6 | 1 |
| Value | Count | Frequency (%) |
| 0.74713737 | 1 | |
| 0.68075085 | 1 | |
| 0.63492453 | 1 | |
| 0.62049097 | 1 | |
| 0.61003417 | 1 | |
| 0.60766214 | 1 | |
| 0.60516906 | 1 | |
| 0.5874526 | 1 | |
| 0.58547086 | 1 | |
| 0.58512235 | 1 |
label_2
Categorical
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 456.3 KiB |
| 0 | |
|---|---|
| 1 | 4669 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 58386 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 53717 | |
| 1 | 4669 | 8.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 53717 | |
| 1 | 4669 | 8.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 53717 | |
| 1 | 4669 | 8.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 58386 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 53717 | |
| 1 | 4669 | 8.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 58386 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 53717 | |
| 1 | 4669 | 8.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 58386 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 53717 | |
| 1 | 4669 | 8.0% |
pred_3
Real number (ℝ)
| Distinct | 58084 |
|---|---|
| Distinct (%) | 99.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.0077101552 |
| Minimum | 0.00012598517 |
|---|---|
| Maximum | 0.19338268 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 456.3 KiB |
Quantile statistics
| Minimum | 0.00012598517 |
|---|---|
| 5-th percentile | 0.0015437935 |
| Q1 | 0.0034136534 |
| median | 0.005793022 |
| Q3 | 0.0096697836 |
| 95-th percentile | 0.019977439 |
| Maximum | 0.19338268 |
| Range | 0.19325669 |
| Interquartile range (IQR) | 0.0062561303 |
Descriptive statistics
| Standard deviation | 0.0071315162 |
|---|---|
| Coefficient of variation (CV) | 0.92495104 |
| Kurtosis | 41.921935 |
| Mean | 0.0077101552 |
| Median Absolute Deviation (MAD) | 0.0028272582 |
| Skewness | 4.2254181 |
| Sum | 450.16512 |
| Variance | 5.0858523 × 10-5 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.007883686 | 3 | < 0.1% |
| 0.009535172 | 2 | < 0.1% |
| 0.0020297398 | 2 | < 0.1% |
| 0.005581074 | 2 | < 0.1% |
| 0.011066267 | 2 | < 0.1% |
| 0.0074752197 | 2 | < 0.1% |
| 0.0048888237 | 2 | < 0.1% |
| 0.008028185 | 2 | < 0.1% |
| 0.0037553138 | 2 | < 0.1% |
| 0.013138089 | 2 | < 0.1% |
| Other values (58074) | 58365 |
| Value | Count | Frequency (%) |
| 0.00012598517 | 1 | |
| 0.00013584696 | 1 | |
| 0.0001408542 | 1 | |
| 0.00015250737 | 1 | |
| 0.00015432482 | 1 | |
| 0.0001975599 | 1 | |
| 0.00019824866 | 1 | |
| 0.00021555979 | 1 | |
| 0.00022349285 | 1 | |
| 0.00023816832 | 1 |
| Value | Count | Frequency (%) |
| 0.19338268 | 1 | |
| 0.17355661 | 1 | |
| 0.137751 | 1 | |
| 0.13011244 | 1 | |
| 0.12925966 | 1 | |
| 0.1275461 | 1 | |
| 0.11823893 | 1 | |
| 0.111909024 | 1 | |
| 0.11189618 | 1 | |
| 0.11088168 | 1 |
label_3
Categorical
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 456.3 KiB |
| 0 | |
|---|---|
| 1 | 1184 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 58386 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 57202 | |
| 1 | 1184 | 2.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 57202 | |
| 1 | 1184 | 2.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 57202 | |
| 1 | 1184 | 2.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 58386 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 57202 | |
| 1 | 1184 | 2.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 58386 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 57202 | |
| 1 | 1184 | 2.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 58386 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 57202 | |
| 1 | 1184 | 2.0% |
pred_4
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 58263 |
|---|---|
| Distinct (%) | 99.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.2211176 |
| Minimum | 0.0038607 |
|---|---|
| Maximum | 0.93400216 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 456.3 KiB |
Quantile statistics
| Minimum | 0.0038607 |
|---|---|
| 5-th percentile | 0.040009527 |
| Q1 | 0.098113671 |
| median | 0.18470399 |
| Q3 | 0.31074162 |
| 95-th percentile | 0.52296796 |
| Maximum | 0.93400216 |
| Range | 0.93014146 |
| Interquartile range (IQR) | 0.21262795 |
Descriptive statistics
| Standard deviation | 0.1541044 |
|---|---|
| Coefficient of variation (CV) | 0.69693411 |
| Kurtosis | 0.68908138 |
| Mean | 0.2211176 |
| Median Absolute Deviation (MAD) | 0.099361412 |
| Skewness | 1.0124967 |
| Sum | 12910.172 |
| Variance | 0.023748166 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.33021012 | 2 | < 0.1% |
| 0.12056588 | 2 | < 0.1% |
| 0.27612534 | 2 | < 0.1% |
| 0.22770756 | 2 | < 0.1% |
| 0.22043061 | 2 | < 0.1% |
| 0.13232286 | 2 | < 0.1% |
| 0.056873277 | 2 | < 0.1% |
| 0.06956278 | 2 | < 0.1% |
| 0.16752589 | 2 | < 0.1% |
| 0.20707266 | 2 | < 0.1% |
| Other values (58253) | 58366 |
| Value | Count | Frequency (%) |
| 0.0038607 | 1 | |
| 0.0063759116 | 1 | |
| 0.006380804 | 1 | |
| 0.006406149 | 1 | |
| 0.0067538884 | 1 | |
| 0.0068781367 | 1 | |
| 0.007397928 | 1 | |
| 0.007492071 | 1 | |
| 0.007721727 | 1 | |
| 0.007755888 | 1 |
| Value | Count | Frequency (%) |
| 0.93400216 | 1 | |
| 0.9310748 | 1 | |
| 0.9279111 | 1 | |
| 0.9268072 | 1 | |
| 0.919865 | 1 | |
| 0.91918445 | 1 | |
| 0.9060331 | 1 | |
| 0.90261084 | 1 | |
| 0.90051305 | 1 | |
| 0.89736754 | 1 |
label_4
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 456.3 KiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 58386 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 44761 | |
| 1 | 13625 | 23.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 44761 | |
| 1 | 13625 | 23.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 44761 | |
| 1 | 13625 | 23.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 58386 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 44761 | |
| 1 | 13625 | 23.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 58386 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 44761 | |
| 1 | 13625 | 23.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 58386 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 44761 | |
| 1 | 13625 | 23.3% |
pred_5
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 58251 |
|---|---|
| Distinct (%) | 99.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.48817325 |
| Minimum | 0.002602684 |
|---|---|
| Maximum | 0.99169874 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 456.3 KiB |
Quantile statistics
| Minimum | 0.002602684 |
|---|---|
| 5-th percentile | 0.061654868 |
| Q1 | 0.21790319 |
| median | 0.48550943 |
| Q3 | 0.75545749 |
| 95-th percentile | 0.92399838 |
| Maximum | 0.99169874 |
| Range | 0.98909606 |
| Interquartile range (IQR) | 0.5375543 |
Descriptive statistics
| Standard deviation | 0.29015098 |
|---|---|
| Coefficient of variation (CV) | 0.59436068 |
| Kurtosis | -1.3465462 |
| Mean | 0.48817325 |
| Median Absolute Deviation (MAD) | 0.2688394 |
| Skewness | 0.020961991 |
| Sum | 28502.483 |
| Variance | 0.084187594 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.44168693 | 3 | < 0.1% |
| 0.517416 | 2 | < 0.1% |
| 0.5902283 | 2 | < 0.1% |
| 0.4765694 | 2 | < 0.1% |
| 0.76737803 | 2 | < 0.1% |
| 0.82156795 | 2 | < 0.1% |
| 0.86065304 | 2 | < 0.1% |
| 0.36439192 | 2 | < 0.1% |
| 0.46140862 | 2 | < 0.1% |
| 0.14729002 | 2 | < 0.1% |
| Other values (58241) | 58365 |
| Value | Count | Frequency (%) |
| 0.002602684 | 1 | |
| 0.0026368657 | 1 | |
| 0.0027910196 | 1 | |
| 0.0035510373 | 1 | |
| 0.0037912827 | 1 | |
| 0.0043345774 | 1 | |
| 0.004630072 | 1 | |
| 0.005466228 | 1 | |
| 0.005474602 | 1 | |
| 0.005476258 | 1 |
| Value | Count | Frequency (%) |
| 0.99169874 | 1 | |
| 0.99030745 | 1 | |
| 0.9897059 | 1 | |
| 0.98965704 | 1 | |
| 0.9894952 | 1 | |
| 0.989023 | 1 | |
| 0.98899287 | 1 | |
| 0.98837215 | 1 | |
| 0.9880542 | 1 | |
| 0.98787886 | 1 |
label_5
Categorical
HIGH CORRELATION 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 456.3 KiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 58386 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 32418 | |
| 1 | 25968 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 32418 | |
| 1 | 25968 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 32418 | |
| 1 | 25968 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 58386 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 32418 | |
| 1 | 25968 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 58386 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 32418 | |
| 1 | 25968 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 58386 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 32418 | |
| 1 | 25968 |
| age | gender | label_0 | label_1 | label_2 | label_3 | label_4 | label_5 | pred_0 | pred_1 | pred_2 | pred_3 | pred_4 | pred_5 | race | subject_id | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| age | 1.000 | 0.080 | 0.150 | 0.162 | 0.142 | 0.025 | 0.124 | 0.264 | 0.376 | 0.396 | 0.374 | 0.229 | 0.364 | -0.443 | 0.151 | 0.009 |
| gender | 0.080 | 1.000 | 0.009 | 0.025 | 0.000 | 0.011 | 0.045 | 0.056 | 0.079 | 0.041 | 0.009 | 0.044 | 0.077 | 0.074 | 0.155 | 0.033 |
| label_0 | 0.150 | 0.009 | 1.000 | 0.167 | 0.199 | 0.007 | 0.050 | 0.340 | 0.316 | 0.205 | 0.167 | 0.051 | 0.199 | 0.265 | 0.065 | 0.026 |
| label_1 | 0.162 | 0.025 | 0.167 | 1.000 | 0.238 | 0.009 | 0.160 | 0.377 | 0.293 | 0.541 | 0.226 | 0.067 | 0.372 | 0.472 | 0.121 | 0.023 |
| label_2 | 0.142 | 0.000 | 0.199 | 0.238 | 1.000 | 0.016 | 0.092 | 0.264 | 0.291 | 0.278 | 0.371 | 0.016 | 0.285 | 0.328 | 0.066 | 0.031 |
| label_3 | 0.025 | 0.011 | 0.007 | 0.009 | 0.016 | 1.000 | 0.015 | 0.129 | 0.010 | 0.022 | 0.000 | 0.041 | 0.026 | 0.031 | 0.051 | 0.018 |
| label_4 | 0.124 | 0.045 | 0.050 | 0.160 | 0.092 | 0.015 | 1.000 | 0.494 | 0.184 | 0.243 | 0.147 | 0.021 | 0.312 | 0.298 | 0.081 | 0.024 |
| label_5 | 0.264 | 0.056 | 0.340 | 0.377 | 0.264 | 0.129 | 0.494 | 1.000 | 0.385 | 0.424 | 0.213 | 0.086 | 0.470 | 0.527 | 0.129 | 0.028 |
| pred_0 | 0.376 | 0.079 | 0.316 | 0.293 | 0.291 | 0.010 | 0.184 | 0.385 | 1.000 | 0.647 | 0.770 | 0.223 | 0.581 | -0.744 | 0.039 | 0.005 |
| pred_1 | 0.396 | 0.041 | 0.205 | 0.541 | 0.278 | 0.022 | 0.243 | 0.424 | 0.647 | 1.000 | 0.716 | 0.265 | 0.790 | -0.903 | 0.056 | 0.011 |
| pred_2 | 0.374 | 0.009 | 0.167 | 0.226 | 0.371 | 0.000 | 0.147 | 0.213 | 0.770 | 0.716 | 1.000 | 0.242 | 0.707 | -0.776 | 0.022 | 0.016 |
| pred_3 | 0.229 | 0.044 | 0.051 | 0.067 | 0.016 | 0.041 | 0.021 | 0.086 | 0.223 | 0.265 | 0.242 | 1.000 | 0.211 | -0.323 | 0.020 | 0.011 |
| pred_4 | 0.364 | 0.077 | 0.199 | 0.372 | 0.285 | 0.026 | 0.312 | 0.470 | 0.581 | 0.790 | 0.707 | 0.211 | 1.000 | -0.883 | 0.052 | 0.012 |
| pred_5 | -0.443 | 0.074 | 0.265 | 0.472 | 0.328 | 0.031 | 0.298 | 0.527 | -0.744 | -0.903 | -0.776 | -0.323 | -0.883 | 1.000 | 0.065 | -0.014 |
| race | 0.151 | 0.155 | 0.065 | 0.121 | 0.066 | 0.051 | 0.081 | 0.129 | 0.039 | 0.056 | 0.022 | 0.020 | 0.052 | 0.065 | 1.000 | 0.096 |
| subject_id | 0.009 | 0.033 | 0.026 | 0.023 | 0.031 | 0.018 | 0.024 | 0.028 | 0.005 | 0.011 | 0.016 | 0.011 | 0.012 | -0.014 | 0.096 | 1.000 |
| subject_id | age | race | gender | pred_0 | label_0 | pred_1 | label_1 | pred_2 | label_2 | pred_3 | label_3 | pred_4 | label_4 | pred_5 | label_5 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 11812752 | 68 | WHITE | F | 0.056143 | 0 | 0.015012 | 0 | 0.001175 | 0 | 0.004269 | 0 | 0.091285 | 0 | 0.759478 | 1 |
| 1 | 11812752 | 68 | WHITE | F | 0.051060 | 0 | 0.006028 | 0 | 0.001799 | 0 | 0.006515 | 0 | 0.050387 | 0 | 0.863849 | 1 |
| 2 | 11812752 | 68 | WHITE | F | 0.021372 | 0 | 0.004059 | 0 | 0.000743 | 0 | 0.004475 | 0 | 0.092967 | 0 | 0.849508 | 1 |
| 3 | 11812752 | 68 | WHITE | F | 0.031842 | 0 | 0.001862 | 0 | 0.000797 | 0 | 0.006196 | 0 | 0.047191 | 0 | 0.934808 | 1 |
| 4 | 15197921 | 255 | NaN | NaN | 0.034598 | 0 | 0.010697 | 0 | 0.000503 | 0 | 0.005243 | 0 | 0.077575 | 0 | 0.871780 | 1 |
| 5 | 15197921 | 255 | NaN | NaN | 0.032936 | 0 | 0.001877 | 0 | 0.000490 | 0 | 0.001382 | 0 | 0.077464 | 0 | 0.905316 | 1 |
| 6 | 15264766 | 89 | BLACK/AFRICAN AMERICAN | F | 0.671896 | 1 | 0.493776 | 1 | 0.267089 | 1 | 0.004971 | 0 | 0.282813 | 0 | 0.103194 | 0 |
| 7 | 15264766 | 89 | BLACK/AFRICAN AMERICAN | F | 0.814109 | 1 | 0.502852 | 1 | 0.461416 | 1 | 0.011264 | 0 | 0.310682 | 0 | 0.067107 | 0 |
| 8 | 15264766 | 89 | BLACK/AFRICAN AMERICAN | F | 0.303454 | 0 | 0.521768 | 1 | 0.107275 | 0 | 0.004218 | 0 | 0.492928 | 1 | 0.111324 | 0 |
| 9 | 15264766 | 89 | BLACK/AFRICAN AMERICAN | F | 0.361869 | 0 | 0.137940 | 1 | 0.014426 | 0 | 0.003084 | 0 | 0.212341 | 1 | 0.343258 | 0 |
| subject_id | age | race | gender | pred_0 | label_0 | pred_1 | label_1 | pred_2 | label_2 | pred_3 | label_3 | pred_4 | label_4 | pred_5 | label_5 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 58376 | 17790542 | 59 | UNKNOWN | F | 0.074976 | 0 | 0.004913 | 0 | 0.000395 | 0 | 0.001372 | 0 | 0.061323 | 0 | 0.891654 | 1 |
| 58377 | 17790542 | 59 | UNKNOWN | F | 0.182511 | 0 | 0.013124 | 0 | 0.004613 | 0 | 0.002974 | 0 | 0.091015 | 0 | 0.690288 | 1 |
| 58378 | 10271316 | 255 | NaN | NaN | 0.551669 | 0 | 0.583294 | 0 | 0.212700 | 0 | 0.001746 | 0 | 0.588206 | 0 | 0.030887 | 0 |
| 58379 | 10271316 | 255 | NaN | NaN | 0.236507 | 0 | 0.567540 | 0 | 0.042170 | 0 | 0.004201 | 0 | 0.273062 | 0 | 0.155288 | 0 |
| 58380 | 11183154 | 69 | WHITE | M | 0.068156 | 0 | 0.019276 | 0 | 0.000471 | 0 | 0.008284 | 0 | 0.110424 | 0 | 0.752996 | 1 |
| 58381 | 11183154 | 69 | WHITE | M | 0.104890 | 0 | 0.011576 | 0 | 0.001189 | 0 | 0.003390 | 0 | 0.057218 | 0 | 0.851656 | 1 |
| 58382 | 16736626 | 46 | WHITE | M | 0.099216 | 0 | 0.071405 | 0 | 0.004492 | 0 | 0.001722 | 0 | 0.200777 | 0 | 0.613287 | 1 |
| 58383 | 16736626 | 46 | WHITE | M | 0.439834 | 0 | 0.023786 | 0 | 0.000858 | 0 | 0.007120 | 0 | 0.112077 | 0 | 0.549323 | 1 |
| 58384 | 16736626 | 46 | WHITE | M | 0.446967 | 0 | 0.284390 | 0 | 0.009412 | 0 | 0.022791 | 0 | 0.294759 | 0 | 0.150314 | 0 |
| 58385 | 16736626 | 46 | WHITE | M | 0.430079 | 0 | 0.484221 | 0 | 0.052301 | 1 | 0.011312 | 0 | 0.243240 | 0 | 0.158633 | 0 |
Most frequently occurring
| subject_id | age | race | gender | pred_0 | label_0 | pred_1 | label_1 | pred_2 | label_2 | pred_3 | label_3 | pred_4 | label_4 | pred_5 | label_5 | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 13409093 | 54 | WHITE | F | 0.074882 | 0 | 0.656349 | 1 | 0.000365 | 0 | 0.001093 | 0 | 0.186414 | 1 | 0.246692 | 0 | 2 |
| 1 | 13409093 | 54 | WHITE | F | 0.103616 | 0 | 0.222099 | 1 | 0.002610 | 0 | 0.004891 | 0 | 0.297735 | 1 | 0.312928 | 0 | 2 |
| 2 | 13409093 | 54 | WHITE | F | 0.201905 | 0 | 0.651007 | 1 | 0.020195 | 1 | 0.002425 | 0 | 0.398550 | 0 | 0.165690 | 0 | 2 |
| 3 | 13409093 | 54 | WHITE | F | 0.240772 | 0 | 0.347666 | 1 | 0.028545 | 1 | 0.004719 | 0 | 0.552794 | 0 | 0.219581 | 0 | 2 |
| 4 | 13409093 | 54 | WHITE | F | 0.263834 | 0 | 0.153997 | 1 | 0.003183 | 0 | 0.012255 | 0 | 0.563608 | 0 | 0.147290 | 0 | 2 |
| 5 | 13409093 | 54 | WHITE | F | 0.291605 | 0 | 0.058406 | 1 | 0.002382 | 0 | 0.000921 | 0 | 0.316761 | 0 | 0.318836 | 0 | 2 |
| 6 | 13892369 | 18 | BLACK/AFRICAN AMERICAN | M | 0.030402 | 0 | 0.022498 | 0 | 0.000548 | 0 | 0.006008 | 0 | 0.109101 | 0 | 0.816906 | 1 | 2 |
| 7 | 13892369 | 18 | BLACK/AFRICAN AMERICAN | M | 0.068244 | 0 | 0.007198 | 0 | 0.000706 | 0 | 0.002479 | 0 | 0.108349 | 0 | 0.872852 | 1 | 2 |
| 8 | 14242488 | 52 | NaN | F | 0.318883 | 0 | 0.449131 | 0 | 0.001193 | 0 | 0.007989 | 0 | 0.234811 | 0 | 0.282035 | 1 | 2 |
| 9 | 14242488 | 52 | NaN | F | 0.377919 | 0 | 0.199191 | 0 | 0.007245 | 0 | 0.003166 | 0 | 0.385954 | 0 | 0.240554 | 1 | 2 |